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Height-related Gene 

The sex-related height difference in humans is thought to be caused mainly by two 
components: first, a hormonal component determined by the sex dimorphism of bioactive 
gonadal steroids and second, a genetic component attributed to a Y-specific growth gene, 
termed GCY (Tanner, et al. 1966; Smith, et al. 1985; Ogata and Matsuo, 1992). Despite 
extensive mapping attempts for this gene on the human Y chromosome (Ogata, et al. 
1995, Salo, et al. 1995, Rousseaux-Prevost, et al. 1996, De Rosa, et al. 1997), its precise 
position remains unknown. Recent evidence shows that inappropriate cytogenetic 
methodology in the characterization of Y-chromosomal terminal deletions has brought 
about some of the difficulties in elucidating the GCY-critical region. In order to overcome 
these problems, the inventors have considered only patients presenting de novo interstitial 
deletions for the GCY analysis on the Y chromosome (Kirsch, et al. 2000). This approach 
allows the assignment of GCY to a particular chromosomal interval without excluding the 
presence of XO-mosaicism and/or i(Yp) and idic(Yqll) chromosomes in patients with 
terminal deletions. 

The direct comparison of overlapping interstitial deletions in seven adult males with 
normal height, one male with borderline height, and one patient with a large interstitial 
deletion and short stature resulted in the confirmation of the GCY critical interval between 
markers DYZ3 and DYS11. This region roughly encompasses 1.6-1.7 Mb of genomic 
DNA. To improve the resolution in the region of interest close to the centromere, the 
inventors have established additional new STS markers specific for this part of the 
chromosome using our bacterial artificial chromosome (BAC)/P1 -derived artificial 
chromosome (PAC) contig. Molecular deletion analysis using these new Y-chromosomal 
STSs allowed the inventors to narrow down the critical interval to a genomic region of 700 
kb. 

Preferably the regions are to the exclusion of the regions of chromosomes on each side of 
the defined regions. 
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Preferably the region is between SKY1 and sY83. It may include one or both the SKY1 
and the sY83 regions. Preferably the region is between SKY8 and sY83 (preferably 
includes one or both of the SKY8 and sY83 regions), or SKY1 and SKY4. 

The invention provides an isolated region of the Y chromosome between DYZ3 and 
DYS11 which encompasses GCY. Preferably the Y chromosome is a human Y 
chromosome. 

The preferred region is between sY79 and sY81, preferably to the exclusion of the region 
of the Y chromosome outside that area of the chromosome. 

Primers for use in GCY studies are also provided. 

The invention further provides isolated gene/pseudogene sequences which contributes the 
sex related height difference in humans. These may be one or more of the gene or 
pseudogene sequences identified in one or more of the figures. 

The invention further encompasses proteins having the same function as GCY protein and 
which have greater than 65% homology, greater than 70% homology, greater than 75% 
homology, greater than 80% homology, greater than 85% homology, preferably greater 
than 90% homology, and most preferably greater than 95% homology to the GCY protein. 
Preferably this has GCY gene activity, for example it has an effect on the height of a male 
mammal when expressed in that mammal. 

Primers for use in detecting or amplifying a region of GCY are also provided. They may 
be labelled using radioactive or non-radioactive labels known in the art and used using well 
known methods. These methods include PCR, Southern or Northern blottipg. 

Experimental evidence will now be described in detail with reference to the figures in 
which: 

Table 1 is a comparison of the adult height of patients and their siblings. 
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Table 2 is a table of new Y chromosomal STSs 

Table 3 is the PCR/restriction digest analysis of sequence family variants in the AZFc 
region 

Table 4 is a summary of BAC and PAC clones identified during physical map creation. 

Table 5 is a summary of the genomic primers that will be used for microdeletion screening 
in adult males with idiopathic short stature. 

Table 6 is a summary of the sequences of the isolated exon trap clones 

Table 7A is a summary of primer pairs for predicted genes, 

7B is a summary of primer pairs specific for the Y-copy of Adlican (ADLY), 
7C is summary of RT-PCR primer sequences for ADLY, 

Table 8 is RT-PCR primer sequences for exon trap clones, 

Tables 9a & b are tables showing homology of exons between ADLX and ADLY. 

Table 10 is a summary of sequence divergence of genes/pseudogenes from the GC Y region 
and their homology. 

Figure 1. Deletion mapping on the long arm of the human Y chromosome. 

A diagram of the human Y chromosome with Yp telomere to the left and Yq telomere to 
the right is presented at the top. Shown below are the results of low-resolution analysis of 
Y-chromosornes of adult males with normal height or short stature. Along the top border, 
95 Y-chromosomal STSs are listed. Except for SKY3 and SKY8 (see Table 2 for detail), 
all other STSs were previously reported (Vollrath et al., 1992, Jones et al., 1994, Reijo et 
al., 1995). Blank spaces or grey boxes indicate inferred absence or presence of markers for 
which assay was not performed. Asterisks indicate markers in the respective breakpoint 
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regions which could not be tested. In all cases where previously published data of the 
patients were re-investigated, the identical DNA sample used for the primary analysis was 
studied. (Please note that the proximal as well as the distal breakpoint of the interstitial 
deletion of patient #293 resides within satellite type II sequences.) 

Figure 2. Sequence family variant (SFV) typing in the human DAZ locus in distal 
Yqll.23. 

A. Overview and amplicon structure of the human Y chromosome in the vicinity of the 
human DAZ cluster. Each amplicon is represented by specific bands (A, B, D, E, X). 
Shown above are arrows indicating the orientation of each member of an amplicon family 
with respect to each other. The amplicon indicated by bands X arose from a portion of 
chromosome 1 that was transposed to the distal end of the DAZ cluster and partially 
duplicated. 

B- Precise position of selected Y-specific STSs and the SFVs according to the physical 
map of the human Y chromosome. Marker sY157 is highlighted as it was suspected to be 
present in only one copy by multiplex PCR analysis (see text for detail). 

C. Summary of STS and SFV analysis in patients with Y- chromosomal rearrangements 
within the human DAZ cluster region. Grey boxes indicate inferred absence or presence of 
markers. 

D. Sequence family variant typing of SKY10 and SKY12 in genomic DNA of patient 
#1972. Assay is described in Table 3. Along the right are listed fragment sizes (in bp). 
Products are separated by electrophoresis in 3% NuSieve agarose (3:1) and visualized by 
ethidium bromide staining. 

Figure 3. Schematic representation of the organization of the long arm 

pericentromeric region of the human Y chromosome 

A. Diagram showing the distribution of major tandem repeat blocks and general 
organization of sequence homologies. Basically, the region can be subdivided in three 
distinct intervals: a proximal region characterized by 5bp satellite sequences (G), a central 
region with high homology to chromosome 1 (O), and a distal region composed of 
X/Y-homologous sequences (B). Below the precise position of the newly established and 
previously published STS markers in this region are illustrated. At the bottom border, the 
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PAC/BAC contig constructed with the aid of the new STS markers is shown. Prefixes RP1, 
5 indicate PAC clones and RP1 1 BAC clones, respectively. 

B. Localization of the GCY critical interval as defined by high-resolution STS mapping in 
patients with short stature and normal height Black boxes indicate the presence, white 
boxes the absence of the respective STS. Striped boxes depict the dosage unknown regions 
where the breakpoint resides. 

Figure 4. Molecular characterization of the GCY critical region a. Schematic illustration of 
the deletions in the two most crucial patients. SKY1 and sY83 demarcate its boundaries 
because clone Y0308 was found to have a different deletion (see Fig. 3) marking SKY1 as 
one of its boundaries. The AZFa region distally adjacent to the GCY region is indicated, b. 
Structural compartmentalization in three segments with distinct homologies. The segment 
composed of 5bp repeats is shown in green, the segment homologous to chromosomal 
subinterval lq43 in orange, and the segment homologous to Xp22 in blue, c Detailed 
description of annotated BAC clones sourcing the genomic sequence of the GCY region. <L 
Precise positioning of PAC clones used as substrates for exon amplification, e. Location of 
all exon trap clones. Due to its small size and limited single-copy content contained within 
exon trap clone etal was not amenable for further experimental analyses f. Documentation 
of all in silico generated data sets in subsequent layers: gene models (orientation; 
exon/intron structure) - apparent pseudogenes (exon/intron structure; orientation) - 
promoters. Orientation of gene models can be deduced by colour (red: orientation towards 
the centromere; blue: orientation towards the telomere). Please note that the chromosomal 
region covered by CITB-144J01, CITB-298B15, and CITB-203M13 was already 
intensively studied in Sargent et al. 1999. 

Figure 5. Homology comparisons between genes/pseudogenes of the GCY region and their 
functional progenitors. Precise location of the Y-chromosomal copies is indicated. Gene 
pair-specific homology and subchromosomal location of the actual structural gene is shown 
in blue. 

Figure 6. Evolutionary history of KIAA1470. On the left, chromosomal movements are 
illustrated. The upper lateral bar shows the exon/intron structure of the functional 
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progenitor in lp36. Successive degenerating events have shaped KIAA1470 into the two 
pseudogenes on lq43 and Yqll. Both copies on lq43 and Yqll share a 98% nucleotide 
sequence homology with each other, the highest among the 12 retroposons of the 
KIAA1470 gene family. They show 77% and 79% homology to the master gene. 

Figure 7. Comparison of the structural features of the X- and Y-specific adlican 
gene/pseudogene. The coding exons of ADLX are illustrated as boxes. Corresponding 
putative exons of ADLY (also presented as boxes) were identified by homology searches. 
Major rearrangements in the putative transcriptional unit of ADLY are highlighted as black 
triangles. Sizes of mRNAs and ORF are presented for ADLX and ADLY. Primers used in 
RT-PCR assays are shown at their respective locations. Identical colouring above and 
below the separation line indicate non-selectivity for both transcripts. RT-PCR primers 
exclusively presented below the line are specific for ADLY. 

Materials and Methods 

Defining the GCY critical region 

Selection of patients 

Patients #293, JOLAR, #28, #63 and #95 have been described clinically in detail elsewhere 
(Skare et al. 1990; Ma et al. 1993; Foresta et al. 1998; Kleiman et al. 1999). Patient Y0308 
corresponds to case 1 in the study of Pryor et al. 1997. Patients T.M., #1947 and #1972 are 
phenotypically normal males suffering from idiopathic infertility. Genomic DNA samples 
were extracted from peripheral blood leukocytes (#28, #63, #95, Y0308, T.M., #1947, 
#1972) or from lymphoblastoid cell lines (#293, JOLAR). DNA isolated from peripheral 
blood leukocytes of normal males and females served as internal controls. 

Height assessment 

As all individuals are of diverse ethnic origins, height was compared to the respective 
national height standards (Table 1). Patients were of similar age range. When possible, 
special attention was given to adult height comparisons between parents and siblings. Data 
are summarized along with the height standard deviation score (SDS) in Table 1. To 
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calculate the SDS, mean adult height and the standard deviation were taken from the 
corresponding national physical growth studies. 



PCR analysis 

Reactions were performed in a total volume of 50^1 (75mM Tris/HCl pH9.0, 20mM (NHi)2 
S0 4 , 0.1%(w/v) Tween20, 1.5mM MgCl 2 ) containing l.OmM of each oligonucleotide 
primer, lOOng genomic DNA as template, 5 units of Taq DNA polymerase (Eurogentec), 
and each dNTP at ImM in a thermocycler (MJ Research, Inc.) as follows: After an initial 
denaturation step of 95°C for 5min, samples were subjected to 30 cycles consisting of 
30sec at 94°C, 30sec at 60°C and lmin at 72°C followed by a final extension step of 5min 
at 72°C. The Multiplex PCR was carried out as described in Henegariu et al. 1994 with 
minor modifications. Alu-Alu PCR reactions were essentially carried out as described in 
Nelson et al. 1991. Amplification products smaller than 1 kb were resolved on 3% NuSieve 
agarose/1 %SeaKem GTG agarose (FMC) in 1 x TBE (0.089 M Tris-borate/0.089 M boric 
acid/20mM EDTA, pH 8.0). For amplification products larger than 1 kb as well as products 
from Alu-Alu-PCR 9 1 .5% SeaKem GTG agarose gels in 1 x TBE were used for separation. 

PCR primers 

Y-specific STSs, loci and PCR conditions have been described previously (Vollrath et al. 
1992; Jones et al. 1994; Reijo et al. 1995). Sequences of new Y-chromosomal STSs are 
listed in Table 2. Y-specific STSs termed SKY were either derived from YAC, BAC and 
PAC end sequences or from clone-internal sequences amplified by various combinations of 
Alu primers. Primers for the markers SKY10, 11, 12, and 13 were designed to amplify 
fragments spanning unique restriction sites within the genomic DAZ locus (SKY10 from 
RP11-487K20 (AC024067), RP11-70G12 (AC006983), RP11-141N04 (AC008272), 
RP11-366C06 (AC015973), RP11-560I18 (AC053522), RP11-175B09 (AL359453), 
SKY11 and SKY12 from RP11-245K04 (AC007965), RP11-100J21 (AC017005), 
RP11-506M09 (AC016752), RP11-589P14 (AC025246) and SKY13 from RP11-100J21 
(AC017005), RP11-589P14 (AC025246), RP11-823D08 (AC073649), RP11-251M08 
(AC010682), RP11-978G18 (AC073893)) in order to detect 'sequence family variants 6 
(SFVs). 
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Restriction analysis of PCR products 

PCR products were resolved on agarose gels, the appropriate gel bands cut out and the 
DNA isolated with GFX™ PCR DNA and Gel Band Purification Kit 
(Amersham Pharmacia Biotech, Lac.) according to the manufacturer's protocol. Fragments 
amplified from SKY5 and SKY6 were digested with TaqI and BsmI, respectively. To 
detect SFVs at SKY10, SKY11, SKY12 and SKY13, PCR products were digested with 
restriction enzymes as listed in Table 3. 

Se quencing of BAC/PAC/YAC end fragments 

DNA from BAC/PAC clones selected for end sequencing were purified with the 
Nucleobond PC100 Kit (Macherey-Nagel) according to the manufacturer's instructions. 
End fragments were directly sequenced using the Thermosequenase Fluorescent Labelled 
Primer Cycle Sequencing Kit (Pharmacia) and analyzed on a Pharmacia A1.F. express 
(Amersham Pharmacia Biotech). YAC end fragments were generated with 
Alu/Vector-polymerase chain reaction and subcloned in pCR2.1 with the TOPO-TA 
cloning Kit (Invitrogen). Sequencing was performed as described. 



Fluorescence in situ h ybridization 

Metaphase spreads were obtained either from primary blood samples or immortalized cell 
lines. Preparations were made according to standard protocols (Lichter and Cremer 1992). 
Cosmid and plasmid DNA was labeled by nick translation with biotin-16-dUTP (La 
Roche). Slides carrying metaphase spreads were kept in 70% ethanol at 4°C for one week. 
200-300ng of labeled plasmid or cosmid DNA, 20-30ug of human Cot-1 DNA (GIBCO 
BRL), and hybridization buffer (50% formamide, 10% dextran sulfate, and 2 x SSC, pH 
7.0) were mixed, denatured for 5min at 75°C and pre-annealed for 30min at 37°C. The 
slides were denatured for 2 min in 70% formamide and 2 x SSC, pH7.0, at 72°C (Ried et 
al. 1992). The pre-annealed probe was hybridized overnight in a humidifying chamber at 
37°C. Slides were washed and stained with avidin-conj ugated fluorescein isothiocyanate 
(FITC). The signal was amplified with biotinylated anti-avidin followed by staining with 
avidin-FTTC. For the probe all human telomeres (Oncor) the instructions supplied by the 
manufacturer were followed. Chromosomes were counterstained with 
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4 , 3 6-diamidino-2-phenyliiidol dihydrochloride (DAPI). Images were taken separately by 
using a cooled charge coupled device camera system (Photometries, Tucson AZ, USA). A 
Macintosh Quadra 900 was used for camera control and digital image acquisition in the 
TIF 1 format using the software package Nu200 2.0 (Photometries). Separate gray scale 
fluorescence images were recorded for each fluorochrome. Images were overlaid 
electronically and further processed using the Adobe Photoshop software. 

Searching the stature gene 
Microdeletion screening 

Exon amplification 

Shotgun subcloning of PAC clones into pSPL3B. Genomic DNA from chromosome Y 
specific PAC clones was partially digested with Sau3AI. lOOng of isolated fragments in the 
range of 4-10Kb were ligated with lOOng of pSPL3B that had been BamHI digested and 
dephosphorylated. The ligation reaction was transformed into supercompetent E.coli Xl-1 
blue cells (Stratagene) and aliquots of each transformation plated on selective medium 
(ampicillin). Resulting colonies were subsequently pooled for plasmid DNA isolation. 

Cell culture and electroporation. COS7 cells were propagated in DME medium 
supplemented with 10% heat inactivated calf serum. For transfections COS7 cells in 
between the 5 th and 15 th passage were grown to about 75% confluence, trypsinized, 
collected by centrifiigation and washed in ice-cold Dulbecco's PBS. 4x1 0 9 cells were then 
resuspended in cold 0.7ml Dulbecco's PBS and combined in a precooled electroporation 
cuvette (0.4cm chamber, BioRad) with 0.1ml Dulbecco's PBS.containing 15|ig DNA. After 
lOmih on ice, cells were gently resuspended, electroporated (1.2kV, 25jnf) in a BioRad 
Gene Pulser 2 and placed on ice again. After lOmin cells were transferred to a tissue 
culture dish (lOOmm) containing 10ml prewarmed, C0 2 preequilibrated culture medium. 
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RNA isolation, RT-PCR and cloning. Cytoplasmic RNA was isolated 72hrs post 
transfection (QIAGEN RNeasy Kit) and first strand synthesis was performed as 
recommended by the manufacturer with minor modifications: 5\ig of RNA was added to a 
solution containing lOmM of each dNTP and 2jiM of oligonucleotide SA2. The mixture 
was heated to 65°C for 5min and then placed on ice for at least a further minute. After 
adding a reaction mixture containing lOx PCR buffer (Perkin-Elmer Cetus), 25mM MgCl 2 , 
0.1M DTT and RNAsin (35U/jil) , the reverse transcription reaction was transferred to 
42°C for 2min. Ipl of Superscript II RT (200U/jil; Gibco BRL) was then added and the 
reaction incubated at 42°C for 90min and 50°C for 30min. The entire cDNA synthesis 
reaction was then converted to double strand DNA using a limited number of PCR 
amplification cycles in the following 100|xl reaction mixture: lx PCR buffer (Perkin-Elmer 
Cetus), 1.5mM MgCfe, 200nM dNTPs, l|xM SA2, IfiM SD6 and 2.5U Taq polymerase 
(Perkin-Elmer Cetus). 6 amplification cycles were used and consisted of lmin at 94°C, 
lmin at 60°C and 5min at 72°C. To eliminate vector-only and false positive products, SOU 
of BstXI (New England Biolabs) was added directly to the reactions, followed by overnight 
incubation at 55°C. 

10pl of the digest was then used in a second PCR amplification using internal primers in 
the following lOOjxl reaction mixture: lxPCR buffer (Perkin-Elmer Cetus), 1.5mM MgCb, 
200|jM dNTPs, ljiM (CAU) 4 -SD2, lpM (CUA) 4 -SA4 and 2.5U Taq polymerase 
(Perkin-Elmer Cetus). 25 amplification cycles were used and consisted of lmin at 94°C, 
lmin at 60°C and 3min at 72°C. Products were separated by electrophoresis and fragments 
larger than the pure SD2/SA4 RT-PCR product excised and subcloned (CloneAmp pAMPl 
System; Gibco BRL) into pAMPl according to the manufacturer's protocol. Ligation 
reactions were then transformed in ultracompetent E.coli XL-2 blue (Stratagene) and plated 
on selective medium containing X-Gal/IPTG. 

Identification of candidate exons. All white colonies were picked and transferred to 
384-well microliter plates containing selective medium and incubated overnight at 37°C. 
With a 384-pin transfer device 24.5x24.5cm culture plates with and without positively 
charged nylon membranes (Amersham) on top of them were inoculated and also incubated 
overnight at 37°C. Colonies grown on culture plates were pooled for plasmid preparation, 
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colonies on nylon membranes were used for colony lifts. Plasmid inserts were excised, 
purified, and hybridized to nylon membranes containing EcoRI-digests of the PAC clones 
used as the original substrate- Highlighting bands were subsequently isolated and 
hybridized to colony lifts to identify candidate exons. Candidate exons were isolated and 
sequenced by Sequitherm EXCEL ft DNA Sequencing Kit (Epicentre Technologies). 
Sequences were automatically analyzed and read on an ALFExpress DNA sequencer. Table 
6 lists the sequences of the isolated exon trap clones. 

Exon Trapping. DNA from chromosome Y specific PAC (Pl-derived artificial 
chromosome) clones RP1-148J07, RP5-U60A12, RP1-301P22, RP4-532I07 and 
RP1-114A11 was partially digested with Sau3AI and fragments in the range of 4- 10Kb 
were individually subcloned into pSPL3B. COS7 cells were transfected and after 72hrs 
cytoplasmic RNA was harvested using QIAGEN RNeasy Kit cDNA synthesis was 
performed as recommended by the manufacturer (Gibco-BRL). Primers flanking the 
cloning sites were used to identify products larger than the pure SD2/SA4 RT-PCR 
product. These fragments were excised, subcloned (CloneAmp pAMPl System; Gibco 
BRL) into pAMPl and sequenced. Exon trap clones were labelled with 32 P-dCTP by 
random priming and used as hybridization probes on Southern blots. Hybridization: 16 hrs 
at 65°C in standard hybridization buffer (Singh and Jones 1984). Wash: three times for 20 
min each at 65°C in O.lxSSC, 0.1% SDS. 

In sUico gene prediction. Completed genomic sequences from BAC clones RP11-75F05, 
RP11-461H06, RP11-333E09, RP11-558M10, CJTB-298B15 and CJTB-144J01 were 
analyzed for homologies to known genes and virtual gene content using the NEC 
(http://menu.hgmp.mrc.ac.vik) and Rummage (http://genl00.imb-jena.de) software 
packages. Computational identification of promoters and first exons was achieved by 
submitting BAC sequences to FirstEF (http://www.cshl.org/mzhanglab). 

Reverse-transcribed polyA*-RNAs and cDNA libraries. Human polyA + -RNA of 16 fetal 
and adult tissues was purchased either from Clontech or Invitrogen. Human polyA + -RNAs 
from 3 osteosarcoma and 1 bone marrow fibroblast cell line were isolated by the QIAGEN 
Oligotex kit. First-strand cDNA synthesis was essentially carried out as described (Rao et 
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al. 1997). Fourteen cDNA libraries were obtained either from Clontech or Stratagene. A 
collection of 40 cDNA libraries was also provided by the Resource Center of the German 
Human Genome Project (RZPD). The complete list is available on request 

Characterization of potential transcription units. After homology comparison and open 
reading frame (ORF) analysis of exon trap clones, primers were designed for RT-PCR 
amplification. Sequences are summarized in Table 8. In those cases where exon trap clones 
consisted of only one exon, two exon-specific primers were combined with cDNA-library 
specific primers in semi-nested PCR. Primers were designed from predicted gene models to 
amplify across exon/intron boundaries. To provide evidence of transcription, primers were 
used to screen a panel of cDNA libraries and polyA + -RNAs (see above). In the case of 
potential coamplification from homologous transcripts, primers flanked Y-specific 
restriction sites. 

Evolutionary strata classification. Sequence divergence between genes/pseudogenes of the 
GCY region and their functional/non-functional progenitors was determined according to 
Li, 1993. Sequences for all pseudogenes were extracted from genomic sequences: 
KIAA1470PY from BAC clone RP11-75F05 (AC011293), KIAA1470P1 from BAC clone 
RP11-498M14 (AL445675), ADLY from BAC clone RP11-333E09 (AC011302), ARSFP 
and RPS24P1 from BAC clone CITB-144J01 (AC004772), RPS24PX from BAC clone 
RP11-418N20 (AC1 19620), ASSP6 from BAC clone RP11-461H06 (AC012502) and 
ASSP4 from BAC clone GS1-536K07 (AC004616). Sequences for all other genes were 
obtained from published cDNAs, whose GenBank accesion numbers are as follows: ADLX 
(AF245505), ARSF (XM 035467), RPS24 (NM_033022), ASS (X01630), KIAA1470 
(AB040903). THC604695PY was not analyzed as only part of its most terminal exon 
(consisting almost entirely of 3'UTR) was available for comparison with the 
X-chromosomal EST cluster (AA662182 and AA662138). 
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Results 

Mapping of interstitial deletions 

We studied the DNA of nine adult males which originally consulted reproduction centers 
about idiopathic infertility, but were otherwise generally healthy. Of the 9 males, 7 were 
unremarkable with respect to adult heigjit One patient, #293, with a height of 157cm, 
presented short stature (SDS -2.9) and one, Y0308, with a height of 165,5cm showed 
borderline height, being at the 3 rd percentile of normal U.S. height standard (SDS -1.7). 
Adult height of his parents and siblings are in the normal range (Table 1), his brother being 
20.5cm taller than the patient Compared to his target height (178cm) and target range 
(169-1 87cm) he can be considered short All men were ascertained solely on the basis of 
the occurrence of large de novo interstitial deletions on the Y chromosome. Only two of 
those patients had undergone previous chromosomal studies. 

In our effort to localize the GCY locus, we focused on that part of the Y chromosome long 
arm, which was delimited by the boundaries of the interstitial deletions of the patients with 
short stature (Fig. 1). Recently, a detailed physical map of the human Y chromosome 
incorporating 758 ordered STSs and 199 completely sequenced BAC clones has been 
constructed (Tilford et al. 2001). We used a slightly modified PCR multiplex system 
(Henegariu et al. 1994) to test the absence or presence of 28 DNA loci from the Y 
chromosome long arm. In patients where sufficient DNA was available for further PCR 
analysis additional STSs were tested. As a result, 8 of 9 interstitial deletion breakpoints 
could be positioned (Fig. 1). As the deletions of patients JOLAR, #28, #63, #95, T.M., and 
#1947, all with normal height, overlap, most of the long arm of the Y chromosome could 
be excluded as a critical region for GCY. 

As the distal breakpoint of the deletion of patient #1972 does not reside within the specific 
part of the Y chromosome long arm, the nature of the deletion (terminal or interstitial) 
remained unclear. There was also no overlap of his deletion with the deletions of patients 
#1947 and T.M. Relying solely on the results obtained by the STS-based interstitial 
deletion mapping strategy, one could not formally exclude the region distal to sY158 as a 
potential critical region for GCY. However, multiplex PCR analysis always showed a less 



WO 03/091381 




PCT7EP03/04546 



intense amplification product for STS sY157 (a Y-derived marker in close vicinity of 
sY158). To address this problem, the rearranged Y chromosome of patient #1972 was 
investigated in more detail. 



Fluorescence in sita hybridization and sequence family variant typing of patient 
#1972 

The overall integrity of the Y chromosome from patient #1972 was demonstrated by FISH 
of the cosmids LLOYNC03 , M"34F05 (PARI) and IXOYNC03"M"49B02 (PARI) as well 
as the Y-centromere-specific probe Y-97 and the telomere-specific probe c all human 
telomeres' (data not shown). Being aware of the complex structural organization of the 
human DAZ locus (Fig. 2A), we specifically searched for sequence family variants (SFVs). 
To prevent misjudging sequence errors as single nucleotide differences, 
PCR/restriction-digestion assays were developed only from SFVs present in at least two 
overlapping BAC clones. The localization of these SFVs is shown in Fig. 2B. As these 
SFVs could represent allelic variants, ten unrelated normal German males were typed. In 
all cases, the expected fragment pattern could be detected for the Y-chromosome derived 
sequences. In contrast, the fragment pattern deduced from the genomic sequence of the 
chromosome 1-derived BAC clone RP1 1-560118 could not be confirmed (see Table 3 for 
detail). Each SFV-specific PCR/restriction digestion was compared to the presence/absence 
in the corresponding BAC clones. 

Typing the genomic DNA of patient #1972 for all four sequence family variants 
(SKY10/Tsp509I, SKYll/Nlaffl, SKY12/MseI, and SKY13/Cac8I + Tfil) revealed the 
absence of one Y-derived non-allelic sequence variant (Table 3 and Fig. 2C,D). In the case 
of SKY10 the distal copy is deleted. Not surprisingly, in all other typing experiments the 
more proximal copy of the respective SFVs was shown to be deleted. 

Next, we investigated these SFVs in the two patients with the most distal breakpoints (#95 
and #1947). Using genomic DNAs, we determined that both non-allelic variants of SKY1 1, 
SKY12, and SKY13 and one non-allelic variant of SKY10 were absent in patient #1947, 
whereas for all tested SFVs one non-allelic variant was absent in patient #95. 
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Taken together, these results provide evidence that the proximal breakpoint of the 
interstitial deletion present in the Y chromosome of patient #1972 resides within the 
interstitial deletion of patient #1947, thereby excluding this genomic region as a potential 
critical interval for GCY. 

Refinement of the GCY critical interval 

Based on the molecular analysis of the pericentric region of the long arm of the human Y 
chromosome (Williams and Tyler-Smith 1997), the physical extension of the GCY critical 
region as defined by the markers sY78 (DYZ3) and sY83 (DYS11) was estimated to 
constitute 1.6-1.7 Mb (Fig. 3 A) of DNA. The most proximal 400 kb of this region consist 
exclusively of 5bp satellite sequences separated from the Y centromere only by AIu 
sequences. This constant part of the human Y chromosome is therefore unlikely to contain 
coding sequences. The remainder of the GCY critical region is composed of 
X/Y-homologous as well as autosomal/Y-homologous sequence blocks. At the onset of this 
study, only limited coverage in YAC clones was available for this region. In order to refine 
the GCY critical interval and to generate gene finding substrates, it was necessary to 
establish a BAC/PAC-contig of this region. 

We generated 25 additional markers mainly by sequencing the end fragments of BAC, 
PAC, and YAC clones as well as clone-internal sequences amplified by various 
combinations of Alu-Alu oligonucleotide primer pairs. Of those, only 7 turned out to be 
Y-specific (SKY1, SKY2, and SKY4-8) (see Table 2 for detail). The BAC and PAC clones 
identified during the generation of the physical map are summarized in Table 4. 
Meanwhile, some of these clones have been completely sequenced as they form part of a 
tiling path for sequencing the human Y chromosome (Tilford et al. 2001). The proximal 
part of the cloned region between markers sY78 and SKY6 has not been sequenced to date. 
A selection of clones covering the entire GCY critical region is depicted in Fig. 3. 



Confirming the overlap between BAC RP11-295P22 and BAC RP11-322K23 appeared to 
be the most crucial step in the process of contig construction. Y-specific markers derived 
from the opposite end fragments of both clones were suspected to amplify identical-sized 
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fragments from two different loci within the same 5bp satellite region. By testing several 
restriction enzymes known to cut frequently within 5bp satellites composed of the 
consensus sequence (TGGAAk we developed loci-specific PCR/restriction digestion 
assays. Typing all BAC clones mapping to this sequence block with the appropriate 
PCR/restriction digestion assay allowed us to precisely position them thereby confirming 
their overlaps. 

In order to narrow down the critical interval for the GCY gene, we tested for the presence 
of the newly generated STS in patients #293, Y0308, and JOLAR. These results allowed us 
to define a small region for the GCY gene (Fig. 3 and Fig. 4). Direct sequence comparison 
showed that the sequenced BAC clones RP11-322K23, RP11-75F05, RP11-461H06, 
RP11-333E09, RP11-558M10, CITB-298B15, and CITB-203M13 completely cover the 
mapped region between Y-STSs SKY8 and sY83 (DYS11), suggesting that it encompasses 
roughly 700 kb. Basically, the region can be subdivided in three distinct intervals: a 
proximal region characterized by 5bp repeats, a central region with high homology to 
chromosome 1, and a distal region composed of X/Y-homologous sequences. As the most 
distal part of the GCY critical region (beginning with bpl of BAC clone CITB-144J01) 
was already subject of extensive research during the process of characterization of the 
AZFa critical region and was shown to harbour no functional gene (Sargent, et al. 1999), it 
was excluded from further detailed genomic DNA analysis. The most proximal part of the 
GCY critical region consists exclusively of satellite type 3 sequences of the 5bp consensus 
(TGGAA) n and is therefore also not assumed to contain any gene. Leaving these two 
regions out of consideration, we were able to concentrate our efforts to a smaller interval of 
420 kb of DNA. Large-scale sequence comparisons performed by the Advanced PipMaker 
software showed no integration of Y-specific sequences into the chromosome 1 and/or 
chromosome X-homologous regions. 

We have also established new Y-specific markers scattered uniformly across the entire 
420Kb of DNA (Tab 5). 
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Exon trapping in the GCY critical region. 

The boundaries of GCY region are defined by two deletion patients, JOLAR and Y0308 
(Fig. 3). PAC clone, RP1-148J07, extends into a genomic segment exclusively composed 
of 5bp repeats of the satellite 3 type. The very distal PAC clone, RP1-83D22, was not 
included in the experimental analysis, as the region distal to sY82 was previously analyzed 
in the course of defining the transcriptional potential of the AZFa region (Sargent et al. 
1999). To identify transcripts that might encode GCY, we used 5 PAC clones from the 
GCY region as substrates for exon trapping (RP1-148J07 up to RP1-1 14A1 1, Fig. 4). Each 
of the 5 PAC clones from the GCY region was individually subcloned and subjected to 
exon trapping. Nucleotide sequencing of trapped products identified 9 different exon trap 
clones, two of them were composed of two exons (Fig. 4, Tab. 6). All exon trap clones 
were isolated in several copies. Exon/intron boundaries of all 1 1 putative exons matches 
the splice site consensus. Trapped products that mapped to the GCY region were verified 
using PCR by their presence versus absence in males and females and GCY-deleted males 
with short stature. All exon trap clones revealed only one male-specific fragment on 
Southern blots. 



In silica analysis of annotated BAC clones. 

We analysed the genomic sequence of the complete GCY region using the gene prediction 
programs assembled by the NIX and Rummage software packages. Homologous sequences 
were also analysed in the non-redundant (nr) database of GenBank using the BLASTN or 
FASTA algorithm. BAC RP11-75F05, for example, includes a 1Kb segment with a 77% 
homology to the transcriptional unit KIAA1470 on chromosome lp36 (Fig. 5). On BAC 
RP11-461H06 and CITB-144J01, for example, sequences of 2.5 and 1Kb length showed a 
88% and 81% homology with the genes ASS and RPS24 on chromosome 9q34 and 10q22, 
respectively. The Y-chromosomal copies ASSP6 and RPS24P1, however, represent 
pseudogenes and have a progenitor on Xp22 that has been translocated to the Y 
chromosome. Two pseudogenes on RP11-333E09 and CITB-144J01, THC604695PY and 
ARSFP, represent deleted copies of Xp22 specific genes. 

BAC RP11-333E09 includes a deleted duplication (ADLY) of the adlican gene on 
chromosome Xp22 (ADLX). ADLX has been previously shown to be upregulated in 
osteoarthritic tissue and therefore likely plays a role in bone metabolism. The Y 
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chromosome copy, therefore, constitutes an important candidate for a gene involved in 
growth. Despite the loss of exons 3 and 4 as a consequence of intrachromosomal 
recombination, its basic structural organization (Fig. 7) and sequence homology to ADLX 
(Tab. 9a) could still allow to encode a functional protein with similar molecular properties. 
This observation was enforced by a unified predicted gene model of ADLY by all 
gene-finding programs (cfl; Fig. 4). Taking the functionality of the predicted ADLY 
promoter for granted and assuming ADLY would start at the ATG codon also used on the 
X chromosome, an in-frame stop codon at position +359 would result in premature 
termination. One additional promoter was predicted in the sense strand of the last intron of 
ADLY. There is, however, no obvious correlation between the promoter position and the 
significance for potential ADLY expression. 

Using various gene- finding programs we detected 17 gene models in the GCY region (Fig. 
4f). Only five (arl, cfl, crl) overlapped with transcriptional units identified by homology 
search. Conceptual translations of 14 models revealed no protein matches. With respect to 
location and orientation promoters predicted by FirstEF could be assigned to KIAA1470P, 
ADLY, RPS24P1, and ARSFP. 

In conclusion, there is no identity of exon trap clones and gene models/homologies or 
pseudogenes KIAA1470PY, ASSP6, and THC604695PY. Considering ADLY as the most 
attractive candidate for the GCY locus, we directly compared the exon/intron boundaries of 
the Y- and X-derived copy (Tab. 9b). Exons 3 and 4 of ADLX are deleted on the Y copy. 
The remaining 3 internal exons still possess correct 5' and 3' splice sites. 



Searching for a transcriptional unit 

Homology searches performed with all exon trap clones and predicted gene models against 
the dbEST segment of GenBank did not yield any Y-specific EST. PCR and 
PCR/restriction digestion assays with primers corresponding to all putative transcriptional 
units were carried out. Primers derived from all exons of ADLY (Tab. 7B, 7C), the most 
prominent GCY candidate, were used to screen reverse-transcribed polyA + -RNAs from 
osteosarcoma and bone marrow fibroblast cell lines. Whereas ADLX was shown to be 
expressed in all tested cell lines (with the exception of neuronal tissues), no ADLY specific 
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specific transcript was detectable. More extensive screening of polyA + -RNAs from various 
adult and fetal tissues basically led to the same result. We also tested all putative 
transcriptional units in the GCY region for expression in polyA + -RNAs from 21 tissues and 
49 cDNA libraries. RT-PCR assays did not provide proof of a transcribed gene. 

Evolutionary features of the GCY critical region. 

High sequence homology of the Y chromosome to other chromosomal regions is consistent 
with an evolutionarily recent transposition of those regions to the Y chromosome. More 
subtle nuances in synonymous nucleotide divergences of homologous gene pairs (K*) allow 
their integration into distinct evolutionary strata, group 1-4 (Lahn and Page 1999). The 
calculated K* values for all gene pairs in the GCY region along with Ks values from 
reference genes of the different stratas are given in table 6. We noted that the K«, values for 
all X-Y gene pairs can be grouped into the most recent evolutionary stratum (group 4), 
having been embarked on X-Y differentiation 30 to 50 million years ago. This 
classification is independent of the actual functional state of X-chromosomal genes. 
Comparing K* values between the Y-copies in the GCY region and their functional 
progenitors clearly demonstrates that decay of the X-chromosomal copies took place before 
the X-Y recombination occurred. Even more prominent is the difference between K« values 
for die chromosome 1 - chromosome Y gene pairs. The low K* value for the 
KIAA1470P1/KIAA1470PY gene pair points towards a very recent transposition to the 
human Y (Fig. 5). Supporting evidence comes from fluorescent in situ hybridization in 
primates delineating this event to a time period of about 5 to 6 million years ago (Wimmer 
et al. 2002). The K« value for the comparison of KIAA1470PY with its functional 
progenitor in lp36 date the underlying intrachromosomal transposition roughly to about 
150-170 million years ago. 

As the frequency of nonsynonymous substitutions (K.) is a function of both evolutionary 
time and selective constraints on the encoded proteins, the degree of constraint can be 
reflected in the ratio Ks/K* (Li, 1993): Values greater than one indicate the presence of 
constraints on both homologs, and values in the vicinity of one are consistent with lack of 
constraint on at least one homolog. All determined Ks/K* ratios suggest that natural 
selection on the Y copies is not ongoing thereby underlining their pseudogene status. 
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We searched the nr database of Genbank with the homology transitions and the distal 
border of the GCY region to precisely determine the physical extent of the homologous 
regions on chromosomal subintervals lq43 and Xp22. To identify highly conserved 
segments, we used Advanced PipMaker (Schwartz et al. 2000, http://bio.ces.psu.edu) for 
comparing the corresponding DNA. Inspection of the compound dot plot allows the 
identification of those portions of the GCY region absent in homologous sequences. As the 
overall homology of Y/l and Y/X in conserved regions is already in the range of 94-97% 
and 96-99%, putative protein-coding exons are not expected to show average percent 
identities higher than the non-coding environment Careful dot plot analysis showed that all 
novel sequences that have accumulated in the GCY region on the Y after the separation 
from its autosomal or X-chromosomal counterpart are exclusively of repetitive origin. 
Particularly evident is the prevailing preponderance of integrated LINEs family members. 

Discussion 

Since the issue on the existence of a Y-specific growth gene (GCY) was first raised, there 
have been several attempts to define its precise location. Whereas initial studies 
unanimously pointed towards a common region of the Y chromosome long arm (Salo et al. 
1995), more recent investigations have led to the identification of two non-overlapping 
critical intervals (Rousseaux-Prevost et al. 1996, Ogata et al. 1995, De Rosa et al. 1997). 
FISH analyses resolved this apparent contradiction by presenting clear evidence that the 
patient materials used in these, initial investigations contained 45 ,X0 cells and/or i(Yp) or 
idic (Yqll) chromosomes (Kirsch et al. 2000). Both genetic parameters influence the adult 
height of a given individual, thereby rendering it impossible to predict whether such 
patients have lost GCY or not Studies with patients carrying de novo interstitial deletions 
are, therefore, much better suited to address the problem of GCY localization. 

In the course of winnowing the literature for patients with small interstitial deletions, in 
particular close to the centromere, it became clear that those patients are very rare. This 
prompted us to extend our search for patients carrying large de novo interstitial deletions, 
irrespective of their actual adult height. We examined 9 adult patients, 7 of whom 
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presented normal height Futhermore, we could show overlapping deletions, thereby 



excluding GCY to reside between the Y-specific marker DYS1 1 and the pseudoautosomal 
region 2 (PAR2). Two patients, #293 and Y0308, presented interstitial deletions enabling 
the restriction of the GCY critical region to approximately 700 kb of DNA. This region is 
therefore predicted to harbour one or more genes required for normal human growth. 

Exon amplification and gene modeling in the GCY region. 

Although much attention has been drawn to the various azoospermia (AZF) critical regions 
in Yqll as well as Y-encoded testis-specific or ubiquitously expressed genes, the GCY 
region up to now was not searched systematically for transcription units. We have used 
exon amplification, homology search, and in silico gene prediction to identify putative 
genes within this region. This information now provides the means to test candidate genes 
for involvement in human linear growth regulation. Up to date, the major problem in 
de fining the GCY gene was the lack of potential transcription units assigned to this portion 
of the human Y chromosome. Prior to this study, there were only two pseudogenes, 
RPS24P1 and ARSFP, that mapped to the GCY critical region (Sargent et al. 1999). 

By exon amplification we isolated 9 different exon trap clones, two of which were 
composed of two exons. Parallel sequencing efforts of the GCY region by the Human 
Genome Project allowed us to complete our catalog of potential transcription units in the 
GCY region. No Y-specific ESTs were assigned to the region. The Nix and Rummage 
software programs were used to analyze sequence data of completed BACs to predict 
potential genes in the sequence. We have identified 4 new genes/pseudogenes and 17 gene 
models. Of the 17 gene models, only five have homologies to the identified 
genes/pseudogenes. A gene model homologous to ADLY (cfl) was uniformly predicted by 
all gene-finding programs. Though, the probability given by various gene finding programs 
might be overestimated with regard to the gene model cfl. Very large exons, as present in 
ADLY, are less likely to be predicted correctly, but they are most unlikely to be completely 
missed. Consequently the tendency to classify actual pseudogenes as functional genes 
increases with the presence of large exons. The failure to trap exons of the putative ADLY 
transcription unit, albeit possessing correct splice sites, might be an intrinsic feature of 
Y-chromosomal sequences. Complete representation of the AZFc region in cosmid/Pl 
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clones used for exon-trapping experiments (Reijo et al. 1995) led to the detection of DAZ 
as the only gene out of a possible 8 genes/gene families located in this region 
(Kuroda-Kawaguchi et al. 2001). 

Surprisingly, we observed no concordance between the gene models and the exon trap 
clones. It is possible that exon amplification is dependent on the presence of functional 
splice sites in the genomic sequence whereas gene modeling is mainly based upon the 
in-phase hexamer measure (Rogic et al. 2001), a method determining the incidence of 
oligonucleotides of length six in a specific open reading frame. On the other hand, the 
prediction of correct splice sites is less important since such signal sensors have low 
information content and are usually degenerate. Consequently, the exon trap clones need 
not to be necessarily part of one of the predicted gene models, although a substantial 
fraction of the trapped exons (7/1 1) are composed of 75 to 200 nucleotides, a length range 
in which exons are most accurately predicted. Likewise, the putative exons assembled to a 
distinct gene model do not necessarily represent real exons. 

It is possible that the eventual number of genes in the GCY region is smaller since exon 
trap clones and/or gene models turn out to be part of the same transcripts or do not 
represent genes at all. Despite, the number of potential transcription units in the region, 
however, the search for the critical one might still be complicated by the fact that the 
phenotypic effect caused by mutation of the GCY locus is hard to be defined precisely. 
This makes it difficult to predict an expression profile, especially when the gene function is 
unknown. Since human linear growth is a multifactorial trait, growth failure is quite 
common.. Although at least nine growth-controlling genes have been identified up to now, 
only few cases present disease-causing mutations within those genes. Definition of the 
transcription units in the region should now facilitate mutation studies, especially since 
full-length genes/pseudogenes have been isolated. . 

Although reverse-transcribed polyA + -RNAs and cDNA libraries have been extensively 
screened, we have not detected any transcript specific to the Y. This raises the question 
whether our approach was suitable. To assess its usefulness we have verified the 
expression pattern of 20 genes known to be essential for bone development at GenePage 
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(http://genome-ww5.stanford.edu). At least double presence for each selected gene was 
warranted by our screening efforts. This corroborates the existence of an unusual gene with 
an extremely confined spatial and/or temporal expression partem. 

Evolutionary features as a clue to the GCY locus? 

To gain more insight into the molecular genesis of the GCY critical region, we used two 
methods. First, we validated the functional state of the genes/pseudogenes within the GCY 
region by comparing them with their direct and functional progenitors. All gene pairs 
showed Ks/Ka ratios of 1 to 2 rather indicating that the Y copy is a pseudogene. This result 
assigns the X-Y gene pairs to evolutionary stratum 4 which fits very well since all those 
gene pairs share a common evolutionary history. Only one gene pair out of this class, 
AMELX/Y, still encodes a functional X- and Y-copy (Salido et al. 1992). The Y-copy of 
KIAA1470 clearly could be classified as a pseudogene by comparing it with its functional 
progenitor on lp36. Second, we made use of large-scale sequence comparison in order to 
identify potential differences between the subintervals of the GCY region and their 
homologous counterparts in Xp22 and lq43. Neither subregions with a conservation level 
above the molecular environment nor small genomic fragments newly integrated into the 
GCY critical region could be detected. Furthermore, promoter prediction carried out 
simultaneously on homologous genomic sequences revealed no differences. This clearly 
excludes substantial rearrangements within the GCY critical region and lends support to a 
gene underlying male-specific regulatory mechanisms. 
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Table 1 Adult height comparison of patients and their siblings 



Case 


CVinntrv 

of origin 


Height of patient (cm) 
and standard deviation 
score 


National 
height . 
standard 
(cm) 


Heights of family 
members (cm) and 
standard deviation 
score 


#293 


U.S.A. 


157 (SDS -2.9) 
short 


176.9 
(SD 6.8) 


(F) 170 
(M) normal 
(B) normal 


Y0308 


U.S.A. 


165.5 (SDS -1.7) 

borderline 

(short?) 


176.9 
(SD 6.8) 


(F)170 
(M) 168 

(B) 188 (SDS +1.7) 
(S) 170 (SDS -0.4) 


JOLAR 


United 
Kingdom 


168 (SDS -1.0) 
normal 


174.7 
(SD 6.7) 


(F) normal 
(M) normal 
(B) normal 


#28 


Italy 


175 (SDS -0.3) 

• 

normal 


176.7 
(SD 6.5) 


(F) normal 
(M) normal 


#63 


Ethiopia 


170 (SDS +03) 
normal 


168.0 
(SD 7.4) 


(F) normal 
(M) normal 


#95 


Israel 


185 (SDS +1.4) 
normal 


175.6 
(SD6.8) 


(F) normal 
(M) normal 


T.M. 


Belgium 


182 (SDS +13) 
normal 


173.5 
(SD 6.7) 


(F) normal 
(M) normal 


#1947 


Germany 


175 (SDS -0.8) 
normal 


179.9 
(SD 6.4) 


(F) normal 
(M) normal 


#1972 


Germany 


181 (SDS +0.2) 
normal 


179.9 
(SD 6.4) 


175(F) 

165 (M) j 
172 (S) (SDS +1.0) 



Hie standard deviation score (SDS) was calculated based cm the equation: SDS = (X-M)/SD, where X is an 
individual's adult height and M and SD are the mean adult height and the ± 1 standard deviation of the normal 
population, respectively. . 
(M) mother, (F) felher, (S) sister, (B) brother, (NA) n^t available. 
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Table 2 Y-chromosomal STSs 



STS L c ft Primer Right Primer Product 



SKYl 


GGACATTTGGCTGCAGAGAT 


TGGCAATGCACTCTCATCAT 


255 


SKY2 


TCAGGACAGACAGGCTGCTA 


CCTGCCACTGAGCTCCTTAC 


-1700 


SKY3 


TTCTCXXJTCATCTTCCAAGC 


GCTTCCATCCATTAGCAAGG 


167 


SKY4 


CCTTTCATTCCATTCTCTTCCA 


CGCACTTTATGGACTGCAA 


111 


SKY5* 


CCCTCGTCCATTTCTTTTGA 


CCTCGAATTTAATGGATTGC 


202 


SKY6* 


TCAATGGATGCACAGTGTGGC 


TCCACTGAATTCCATTGCAC 


328 


SKY7 


GGGAGTGCAAAGGOAAAGAT 


CTTTCCATGGGGTGACATTC 


223 


SKY8 


CCATTCATTCGAGTTCATTACG 


ATTGGAATGGAATCGGACAG 


189 


SKY9 


GGCCGATGGTCAAACTGTTA 


GAAACGGGCTCTGAAATTCT 


531 


SKY10* 


ATAAGGGGCAGGTTTGTCAC 


GCTACTTATTCAGTGTTTAACTGACAC 


329 


SKY11* 


AAAGTGGGTGAAGGACATGG 


TTTTTGTTTGTGGCAGGTO 


469 


SKY12* 


TTGAGTCACTGGGGATAACTG 


TATGGCCCACAATCACTTCA 


216 


SKY13* 


GGCAGCCTAGAAAGTCTTGTTC 


CCCTTGGGATTTTGTCTGTT 


198 



Markers indicated with a * amplify DNA fragments from more man one genomic tocos (see Chapter Restriction analysis o/PCR 
products for detail). 
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Table 4 Summary ofJJAC^and PAC clones identified daring physical map creation 



Y-STSs Positive BACs (RPCI1 1) Positive PACs (RPCI1, 3-5) 



sY83 
sY82 
GY8 

sY81 

14A3C* 

sY79 

SKY1 
SKY2 

SKY4 
SKY5 

SKY6 



not screened 
not screened 
not screened 

not screened 

not 



75F05, 79B14, 102G24, 322K23, 
417D23, 600D11, 612E10, 725112, 
863108, 903M02, 1 125H21 

376B16, 544C1 1, 544M21 
79P12, 295P22, 376L20, 828Q24, 
886111, 910C06 

75F05, 322K23, 612E10 
174124, 271E18, 295P22, 588E18, 
620X20, 632F1 1, 684H19, 705019 
174124, 271E18, 295P22, 588E18, 
620J20, 632FU, 684H19, 7&5019 



83D22 

83D22, 114A11, 157G08, 966C15 
114A11, 168E21, 271D03, 635F21, 
765H16, 806O15, 904E13, 966C15 
301P22, 1079J08, 1078C20, 
1160A12 

148J07, 1136A14, U60A12, 

1196123 

1149HH 



56A05, 85D24, 958M03 
829H08 

not screened 
not screened 

not screened 



* 14A3C is a hybridization probe previously described by Tyler-Smith et aL 1993. It detects a Y-specific 
HindlH-fragraent of 3.5 kb and an additional autosomal fragment 



WO 03/091381 




PCT/EP03/04546 



1 



I 

B 



09 



i 

i 
1 

.a 

WD 



§ 

1 

§ 
1 

I- 



i 
i 

-a 

H 



o 
m 
«o 
CM 

ON 

to 



io 
oo 

cm 

ON 



CM 
ON 



Pi 



3 

i— « 
OO 
IT* 



VO 



oo 
to 



to 
cm 



On 
to 

co 



vo 
co 



ON 

o 



o 



CN 

o 
o 
vo 



oo 

CM 



oo 
oo 



to 

3 



to 



ON 



ON 



ON 



m 
ro 
oo 
o 

CM 
I 

cm 
to 
oo 

8 



o 
to 



co 



g 

ON 
I 

ON 

s 



oo 
cm 
to 



o\ 
o 
to 



o 

CO 

to 



to 



vo 



tO 
CM 



oo 

ON 

to 



m 
i 

vo 
o 



ON 
OO 
ON 
tO 
I 

*— * 

oo 
o> 
to 



cm 
so 

I 

oo 

CM 
CN 



to 



vo 



oo 

On 
a 

ON 

»o 
r- 
vo 
OS 



oo 

CM 



CM 



ON 



CN 



CM 



oo 

CM 
CM 



1 



9 



< 2 



« 



c^ 
U 



ON 



CM 

6 



« 8 



to 

CM 



to 

CM 



CM 



CM 



to 
CM 



S 

CM 



*o 



to 

ON 



3 



5 



CM 
VO 



in 



i 



I 
I 

i 
i 



1 



2 



I 



I 



S 



1 



I 



I 

i 



8 



1 
3 



s 



.1 



i 



3 



i 



2 



l 

g 



g 



1 



5 



in 



1 



1 



! 



1 



| 



2 



WO 03/091381 




PCI7EP03/04546 



i 
J 



i 
1 



NO 



CO 

S 

Q 



f 
I 



t. 

i 

CO 

i 



i 

eh 
Eh 
CJ 

i 

E-J 



i 

E-i 
O 
CD 
CD 
E-» 
Eh 

u 

CD 



GO 



Eh 
CD 
O 
Eh 
Eh 
O 
CD 
Eh 
U 
Eh 
Eh 

1 



3 



s 



5 



o 

Eh 
Eh 
Eh 
Eh 
EH 

6 
S 

o 

Eh 
U 
O 



g 



g 

8 



0 



8 



to 

OS 



1 



s 



B 

EH 
EH 

O 

EH 

s 

i 

c_> 

Eh 
O 
Eh 



oo 
oo 



a 

CD 

1 

EH 
EH 

o 

O 
O 

l 

EH 

I 

1 

! 

CJ 
Eh 

I 
g 

1 

cd 

s 



CD Eh 
CD CD 

f=C — CD 

Eh x-i CD 

Eh CJ § 

CD O CD 

Eh X 

CD <D CJ 

CD — CD 
Eh Eh 

Eh Eh CD 



1 



OS 



Eh 



CD 

I 



Eh 

B 

Eh 

i 




s 



CD 
CD 
Eh 
O 
Eh 



CD 

6 

CD 
Eh 

I 

8 



S 

EH 
EH 

CD 
CD 
EH 
Eh 
O 



WO 03/091381 



PCT/EP03/04546 



| 

1 




WO 03/091381 




PCT/EP03/04546 



31 



Table 7B Primer pairs for Y copy of Adlican 



Primersequence (5'-» 3*) 


Direction with respect to putative 
transcription orientation 


primer 


GACTCCTGGCCTTGACTTGA 


ll/l WolU 




AdIYExl 


TCTCTGTGGTGCTGATCCTG 


forward 




cfl 


GGAGGAGCAAAAACAAGAAGAGA 


forward 




cfl-117 


ACTGATGAGCACGGGAACC 


forward 




cfl-205 


TCCATCCTGAAAGTGcrrro 


forward 




C17c 


ACATGTATACATGCTGCCAA 


forward 




C18 


CAGCGAAGG A A AGP A C* A TTT 


forward 




AdIYExS 


GGCGACCTG A AGGGG A CT 


forward 




cfl-1915 




forward 




C21 


GAAGCATCCACCAAAGCG 


forward 




cfl -4679 


ACAGCGfiOTTtPT A TO A GT 


forward 




cfl -4a 








CAGGATCAGCACCACAGAGA 


reverse 




AdlYEx2 


CTGGGGAAGTTGGATTTTCTC 


reverse 




C17b 


ACCAGGTTCCCGTGCTCA 


reverse 




cfl-227 


GCAAGAATCTGGGCTCTCAC 


reverse 




cfl 


ACTGTGATTCCCACCGTGAT 


reverse 




C17c 


TTGTTTTGAGGAACGCCTCT 


reverse 




C18 


GGATGTGGGATCTGGTGAG 


reverse 




cfl-2079 


GGGTGTAATTTTCTCCCATTG 


reverse 




AdlYEx5 


CGTCCGTTTCAGCAGTGACA 


reverse 




cfl -4810 


CTGACGTCCGTCCTCTGC 


reverse 




cfl-4b 


ATGGACAGTGATCCGGTTTC 


reverse 




cfl-6453 


TGAGCTGCACGATCAACCTC 


reverse 




cfl-6559 
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